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WO 98/40520 PCTYUS98/04919 

METHOD FOR SEQUENCING OF MODIFIED NUCLEIC ACIDS 
USING ELECTROSPRAY IONIZATION - 
FOURIER TRANSFORM MASS SPECTROMETRY 

Field of the Invention 

The invention relates to the determination of nucleotide sequences for 
nucleic acids and their analogs. 

Brief Summary of the Related Art 

Determination of nucleotide sequences for various nucleic acids has become 
a fundamentally important analytical step for numerous molecular biology and 
biomedical applications. Consequently, a variety of methods have been developed 
to facilitate such nucleotide sequence determinations. Maxam and Gilbert, Proc. 
Natl. Acad. Sci. USA 74: 5460 (1977) discloses a chemical degradation approach for 
DNA sequence determination. Sanger et al, Proc. Natl. Acad. Sci. USA 74: 5463 
(1977), discloses a chain termination method using complementary strand primer 
extension to determine DNA sequences. Each of these methods utilizes four 
separate reaction mixtures to create a nested set of fragments differing by a single 
nucleotide in length and representing a complete nucleotide sequence, followed by 
resolution of the fragments based on their size to determine the order of the 
fragments and hence the nucleotide sequence. Both of these procedures take from 
numerous hours to days to perform, and neither is effective for determining the 
nucleotide sequence for certain analogs of DNA. 

These popular approaches to nucleotide sequence determination are limited 
by their time consuming methodologies and by their inapplicability to certain 
types of nucleic acid analytes. For example, small synthetic oligonucleotides have 
recently become of interest as tools in molecular biology experiments, as well as 
for use in the antisense therapeutic approach to disease treatment. Correct 
sequences are necessary to the efficacy and safety of such oligonucleotides, and 
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effective and rapid analytical approaches are needed for quality control. This class 
of compounds presents three special problems for traditional sequence 
determination approaches. First, quality control procedures are needed which are 
more rapid than the traditional approaches. Second, the oligonucleotides are 
generally short, often in the range of from about 15 to about 35 nucleotides in 
length. As a consequence, traditional sequencing approaches result in the loss of 
much sequence information and thus provide only incomplete information about 
the nucleotide sequence of the oligonucleotides. Finally, many of these 
oligonucleotides have either modified internucleoside linkages or substitution at 
the 2' position of the ribose to improve their properties as potential therapeutic 
agents. U.S. Patent No. 5,220,007 discloses chimeric oligonucleotides having 
regions of oligonucleoside phosphodiester or phosphorothioate alongside regions 
of oligonucleoside alkylphosphonate or phosphor amidate. PCT publication 
WO94/02498 discloses hybrid oligonucleotides having DNA regions alongside T- 
substituted RNA regions. Uhlman and Peyman, Chemical Reviews 90: 544 (1990), 
discloses oligonucleotides having a variety of modifications along the 
internucleoside linkages, sugar residues or nucleoside bases. 

Recently, some approaches have been developed to address these special 
problems. U.S. Patent No. 5,403,709 discloses a method for sequencing 
oligonucleotides using another oligonucleotide as an extension and a third, 
bridging oligonucleotide to hold the first two together for ligation. Conventional 
primer extension is then used to create a complement for sequencing. This 
approach requires some advance knowledge of a portion of the sequence of the 
analyte oligonucleotide. U.S. Patent No. 5,525,470 discloses a similar approach 
which avoids the need for such advance knowledge by utilizing RNA ligase to 
couple the analyte and extension oligonucleotides. Both of these approaches can 
be used with some modified oligonucleotides, especially with those having 
phosphodiester or phosphorothioate internucleotide linkages. Both, however, are 
time-consuming and limited to analyte oligonucleotides which can act as templates 
for the polymerase enzyme used to synthesize their complement. 
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Recently, Nordhoff et al, J. Mass Spec. 30: 99 (1995) disclosed a direct 
sequencing method for oligonucleotides using infrared matrix-assisted laser 
desorption/ ionization mass spectrometry. This method utilized time-of -flight 
analysis of prompt fragment ions generated by infrared laser 
desorption /ionization. Fragments were reported to be generated by consecutive 
cleavage of the deoxyribose phosphate backbone at apurinic sites. Unfortunately, 
such cleavage does not occur at T residues, due to failure to remove the thymidine 
base, and may not occur in RNA due to stabilization of the glycosidic bond. In 
addition, there is a problem with the prompt fragment ions overlapping with 
fragment ions (PSD ions) from metastable decays occurring in the field-free region 
of the time-of -flight analyzer. 

Brown and Lennon, Anal. Chem. 67: 3990 (1995) discloses sequence-specific 
fragmentation of matrix-assisted laser-desorbed protein/peptide ions and detection 
of the fragments using time-of -flight mass spectrometry with delayed pulsed ion 
extraction. The delayed pulsed ion extraction is used to reduce the generation of 
PSD ions by expanding the desorbed neutral plume during the extraction delay 
period, thereby avoiding energetic collisions believed to play a role in the 
generation of PSD. The technique was found to be applicable to small peptides 
and in one special case to a larger protein. 

Little et ah, J. A. Chem. Soc. 116 : 4893 (1994) disclosed a sequencing method 
for oligonucleotides using electrospray ionization coupled with Fourier-transform 
mass spectrometry for oligonucleotides. Fragments were generated using nozzle 
skimmer dissociation to provide sequence information. The technique was found 
to be applicable to oligonucleotides from 8 bases to as large as 25 bases. 

There remains a need for new sequencing approaches that address the 
special sequence determination problems presented by synthetic oligonucleotides. 
Ideally, such a new approach should be rapid and universal, should act directly 
on the nucleic acid analyte, and should be applicable to any chemically modified 
oligonucleotides. 
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BRIEF SUMMARY OF THE INVENTION 



The invention provides a universal analytical method for determining the 
nucleotide sequence of nucleic acid analytes, including any chemically modified 
oligonucleotides. This new method utilizes electrospray ionization-Fourier 
transform ion cyclotron resonance mass spectrometry (abbreviated as ESI-FT- 
ICRMS or ESI-FTMS). This method is extremely rapid and acts directly on the 
oligonucleotide. It is effective for a variety of nucleic acid analytes, in particular 
for any chemically modified oligonucleotides which have not previously been 
successfully sequenced. 

The invention provides a method for determining the nucleotide sequence 
of nucleic acid analytes comprising providing a suitable target nucleotide sequence 
for ionization by electrospray ionization; exciting the ions of the target; 
fragmenting the target; and determining the nucleotide sequence of the target by 
measuring the mass of the resultant fragments. The target ions can be excited 
and fragmented by any of the known techniques. In one particularly preferred 
embodiment the target ions are excited by sustained resonance excitation (SORI) 
and subsequently fragmented by collisionally activated dissociation (CAD). 
Preferably, the collisionally activated dissociation is by a neutral gas. In another 
preferred embodiment, the target ions are excited and fragmented by nozzle spray 
dissociation (NS). 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic drawing of the BioApex FTMS Vacuum System which 
FTMS system was used in the method according to the invention. 

Figure 2 shows an ESI-FTMS mass spectrum of a 25-mer phosphorothioate 
oligodeoxynucleotide having the sequence 5'- 

CTCTCGCACCCATCTCTCTCCTTCT-3'. The mono-oxygenated 25-mer is also 
detected, as shown in the inset. 

Figure 3, panels A-E, show the ESI-FTMS spectra of a 18-mer phosphorothioate 
oligodeoxynucleotide having the sequence S'-AAAAAAAAAAAAAAAAAT-S' 
obtained by SORI CAD. As shown, the entire sequence can be determined by 
tandem Mass Spectrometry (MS /MS). In panels A-E the following ion series have 
been identified, respectively, (A) a-B ions; (B) singly charged w ions; (C) doubly 
charged w ions; (D) triply and quadruply charged w ions; (E) M-n B ions. 

Figure 4 shows the ESI-FTMS spectra of an 18-mer phosphorothioate 
oligodeoxynucleotide having the sequence S'-AAAAAAAAAAAAAAAAAT-S' 
obtained by nozzle spray dissociation. Only a partial sequence was obtained. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



The invention relates to the determination of nucleotide sequences for 
nucleic acids and their analogs. The patents and publications cited herein are 
known to those skilled in this field and are hereby incorporated by reference in 
their entirety. 

The invention provides an analytical method for determining the nucleotide 
sequence of nucleic acid analytes, including chemically modified oligonucleotides. 
This new method utilizes electrospray ionization-Fourier transform ion cyclotron 
resonance mass spectrometry. This method is extremely rapid and acts directly on 
the nucleic acid analyte. It is effective for a variety of nucleic acid analytes, 
including any chemically modified oligonucleotides which have not previously 
been successfully sequenced. 

The invention provides a method for determining the nucleotide sequence 
of nucleic acid analytes comprising providing a suitable target nucleotide sequence 
for ionization by electrospray ionization; exciting the ions of the target; 
fragmenting the target; and determining the nucleotide sequence of the target by 
measuring the mass of the resultant fragments. The target ions can be excited 
and fragmented by any of the known techniques. In one preferred embodiment 
the ions are excited by sustained resonance excitation (SORI) and fragmented by 
collisionally activated dissociation (CAD). Preferably, the collisionally activated 
dissociation is by a neutral gas. In another preferred embodiment the target ions 
can be excited and fragmented by nozzle spray dissociation (NS). 

The nucleic acid analyte can be a naturally occurring or synthetic 
polynucleotide or oligonucleotide, including oligonucleotides having chemically 
modified internucleoside linkages, sugar backbones or nucleoside bases. For 
purposes of the invention, the term ''oligonucleotide" includes polymers of two or 
more deoxyribonucleoside, ribonucleoside or 2'-0-substituted ribonucleoside 
monomers, or any combination thereof. Preferably, such oligonucleotides will 
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have from about 2 to about 100 monomers, and most preferably from about 8 to 
about 70. Such monomers may be coupled to each other by any of the numerous 
known internucleoside linkages. In certain preferred embodiments, these 
internucleoside linkages may be phosphodiester, phosphotriester, 
phosphorothioate, or phosphoramidate linkages, or combinations thereof. The 
term oligonucleotide also encompasses such polymers having chemically modified 
bases or sugars and/or having additional substituents, including without 
limitation lipophilic groups, intercalating agents, diamines and adamantane. For 
purposes of the invention the term "2 , -0-substituted M means substitution of the 2' 
position of the pentose moiety with an -O-lower alkyl group containing 1-6 
saturated or unsaturated carbon atoms, or with an -O-aryl or allyl group having 2- 
6 carbon atoms, wherein such alkyl, aryl or allyl group may be unsubstituted or 
may be substituted, e.g., with halo, hydroxy, trifluoromethyl, cyano, nitro, acyl, 
acyloxy, alkoxy, carboxyl, carbalkoxyl, or amino groups; or such 2' substitution 
may be with a hydroxy group (to produce a ribonucleoside), an amino or a halo 
group, but not with a 2*-H group. 

The preferred target is a synthetic polynucleotide or oligonucleotide having 
chemically modified internucleoside linkages, sugar backbones or nucleoside 
bases. Preferably, a suitable target will be solubilized in a volatile organic solvent. 
More preferably the volatile solvent includes an alcohol or acetonitrile. In a 
particularly preferred embodiment the volatile solvent is an approximately equal 
ratio mixture of water and organic solvent. Useful alcohols include, without 
limitation, propanol, isopropanol, methanol and ethanol. The volatile solvent may 
also include low concentrations of organic or inorganic bases, for example 
piperidine. 

The targets are ionized by ESI using a voltage differential between the 
needle and the end cap. For example the voltage of the needle can be set at 
ground and the voltage of the end cap can be set at a positive level. The target 
will be negatively ionized when sprayed through the needle and will be attracted 
to the positively charged end cap. Alternatively the voltage of the needle could be 
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set to a negative level, and the target will be attracted to the relatively positively 
charged end cap. In addition, the level of ionization can be manipulated by 
increasing or decreasing the level of pneumatic flow on the needle or by altering 
the differential voltage between the needle and the end cap. 

After the target is ionized by spraying it through the needle towards the 
end cap, the target must be dried. Drying preferably is done at a temperature of 
at least 150°C, and more preferably drying is done at a temperature between 150- 
250°C. Drying also can be in the presence of a gas, for example a neutral gas such 
as carbon dioxide. 

The ESI ionized targets are then transmitted through a capillary, a tube 
lens, and a beam skimmer, guided by a series of three quadrupoles through five 
stages of differential pumping to an open cylindrical ion cyclotron resonance (ICR) 
cell. The ionized target is excited, fragmented and the nucleotide sequence is 
determined. 

When the nucleotide sequence is to be determined by CAD the molecular 
weight of the intact target molecule first can be determined by mass spectrometry. 
The ionized target is then excited and fragmented. For example, excitation can be 
by SORL When using SORI, the level of ion excitation can be manipulated by the 
frequency shift of the excitation and the bursting time of excitation. Preferably the 
ions are excited by single shot excitation. The frequency shift is preferably from 0 
to 2 kHz, and the attenuation should be adjusted to produce a pulse to pulse 
range 

(p-p) in the range of 0 to 10 V. The bursting time is preferably from 0 to 1000 ms, 
more preferably 100 to 400 ms. In one particularly preferred embodiment the 
target ions are excited at a frequency shift of ±650 Hz with attenuation of 35 dB 
(2V p-p) and a bursting time of 300 ms. After SORI excitation the targets are 
fragmented by CAD. Prior to CAD all but the selected ions can be ejected from 
the cell. The excited targets are collided with a neutral gas to fragment the 
molecule. The extent of collisions are controlled to limit the extent of target 
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fragmentation. In one particularly preferred embodiment the collision gas is 
carbon dioxide. The molecular weight of the resultant fragments are then 
determined by mass spectrometry and the nucleotide sequence is determined. 

Excitation and fragmentation also can be done by NS dissociation. For NS 
dissociation the ionized target is dissociated while it is guided from through the 
capillary to the ICR cell. For NS dissociation the voltage differential between the 
capillary exit and the skimmer is preferably between ±0 to ±500 V. In particularly 
preferred embodiments the voltage differential is 140 V. 

The following examples are intended to further illustrate certain preferred 
embodiments of the invention and are not intended to limit the scope of the 
invention. 
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Example 1 
Preparation of an appropriate target 



An 18-mer phosphorothioate oligodeoxynucleotide having the sequence 
S'-AAAAAAAAAAAAAAAAAT^ was prepared as a solution in HPLC grade 
water (Baker) at a concentration of 5000 ppm using the solid phase 
phosphoamidite method on a Beckman Oligo-1000 synthesizer (Fullerton, CA). 
The sample was then treated with cation exchange resin in ammonium form (200- 
400 mesh) to reduce the content of ammonium salt. 
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Example 2 

Sequence determination of a nucleic acid analyte using ESI-FTMS 



The oligodeoxynucleotide target prepared according to Example 1 was 
solubilized in 100:100:1 water: isopropoanokpiperidine and sprayed through a 
needle towards the end cap in the negative ion mode with pneumatic assistance. 
The sequencing was performed in an ESI-FTMS, the 7.0 tesla Bruker Apex 70E 
system (Bruker Analytical Systems, Inc., Billerica, MA) a schematic of which is 
shown in Figure 1. The pressure in the source chamber and the analyzer chamber 
was 4x10 6 torr and 7xl0" 10 torr, respectively. Carbon dioxide was used both for 
drying and sheath flow. The drying gas temperature was at 200°C. 

To sequence the oligonucleotide using the SORI CAD method, the precursor 
ions were activated by single shot excitation with frequency shift of ±650 Hz, 
attenuation of 35 dB (2V p-p), and bursting time of 300 ms. Carbon dioxide was 
used as the collision gas. The voltages were as follows: needle, ground; capillary 
entrance, 4 kV, end cap, 3.5 kV, cylinder, 3.5 kV; skimmer, 

-5 V; capillary exit, -80 V. The analyzed mass spectra are shown in Figures 3A-E. 

To sequence the oligodeoxynucleotide using nozzle skimmer dissociation, 
the voltages were as follows: needle, ground; capillary entrance, 4 kV, end cap, 3.5 
kV, cylinder, 3.5 kV; and the voltage differential between the capillary exit and the 
skimmer was 140 V. The analyzed mass spectra is shown in Figure 4. 
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1. A method for determining the nucleotide sequence of a nucleic acid analyte 
having at least one modified internucleoside linkage, sugar backbone or 
nucleoside base, the method comprising: 

a) providing a suitable target nucleotide sequence for ionization by 
electrospray ionization, such target comprising a nucleic acid analyte 
having at least one modified nucleic acid; 

b) exciting the ions of the target; 

c) fragmenting the target; and 

d) determining the nucleotide sequence of the target. 

2. The method of claim 1 wherein step b further comprises exciting the ions 
by sustained off resonance excitation. 

3. The method of claim 2 wherein said excitation is a single shot excitation. 

4. The method of claim 1 wherein step c further comprises fragmenting the 
target by collisionally activated dissociation. 

5. The method of claim 4 wherein said collisionally activated dissociation is by 
a neutral gas. 

6. The method of claim 5 wherein said neutral gas is carbon dioxide. 

7. The method of claim 1 wherein steps b and c are nozzle skimmer 
dissociation. 
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